wo 2005/080578 



PCT/GB2005/000549 



Desaturase Enzymes 

The invention relates to transgenic cells transformed with nucleic acid molecules 
which encode enzymes with desaturase activity and the use of these cells and 
5 enzymes in biocatalysis. 

Desaturases are enzymes involved in the synthesis of long chain polyunsaturated fatty 
acids (PUFAs). PUFAs are fatty acids (FAs) which are essential to the normal 
functioning of a cell and their nutritional properties are well known. An example of 

10 a PUFA is docosahexanoic acid (DHA). DHA is a n-3 fatty acid that can be obtained 
directly from the diet or derived from metabolism of dietary linoleic and a-linolenic 
acid. The n-3 fatty acids are associated with health promoting properties. For 
example n-3 fatty acids have been described as anti-inflammatory, antithrombotic, 
antiarrhythmic, hypoUpidemic and vasodilatory. As such, the role of DHA in the 

15 prevention and/or treatment of diseases such as coronary heart disease, hypertension, 
type n diabetes, ocular diseases, arthritis, cystic fibrosis and schizophrenia has been 
the focus of a great deal of medical research. 

The production of PUFAs involves a consecutive series of desaturations and 
20 elongations of the fatty acyl chain to generate arachidonic acid (20:4A5,8,11 ,14) and 
docosahexaenoic acid (22:6A4,7,10,13, 16,19). Several desaturases involved in this 
metabolic process have been isolated from marine microalgae, including 
Phaeodactylum tricornutum [5], Euglena gracilis [6] Pavlova lutheri [7]. These 
membrane-bound desaturases are specific with respect to both chain length of the 
25 substrate and the double bond positions on the fatty acid. They belong to the class 
known as front-end fatty acid desaturases due to the fact that they introduce double 
bonds between the carboxy-group and pre-existing bond(s) of the fatty acid [1]. 
These desaturases contain a cytochrome b5 domain at their N-terminus and three 
histidine motifs that are important for catalytic activity [10], 

30 
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Desaturase enzymes and the genes which encode them are known in the art. For 
example, WO03/064596 describes, amongst other things, transgenic cells 
transfomied with omega 3 and delta 12 desaturase nucleic acid molecules and the use 
of these cells in the production of fatty acids. In particular the use of the omega 3 
5 desaturase in the conversion of arachidonic acid to eicosapentaenoic acid and the use 
of the delta 12 desaturase in the conversion of oleic acid to linoleic acid. 
WO03/099216 also describes fungal desaturases and in particular transgenic plants 
modified to express fungal delta 15 desatxurase enzymes. 

10 Furthermore, US2003/0157144 and US2003/0 167525 disclose delta 5 and delta 6 
desaturase genes in the conversion of dihomoylinolenic acid to arachidonic acid and 
linoleic acid to y-linolenic acid respectively. Moreover, US2003/1 34400 discloses 
delta 4 desaturase genes which are involved in the conversion of adrenic acid to (o6- 
docosapentaenoic acid and in the conversion of co3- docosapentaenoic acid to 

15 docosahexaenoic acid. These rare fatty acids are used in pharmacutical and cosmetic 
compositions and can be essential nutritional fatty acids. 

Besides the common FAs 16:0, 16:1A9, 18:0 and 18:1A9 found in most living 
organisms, trace amounts of more unusual fatty acids can be found in a wide range of 

20 species. For instance, presence of 16:1 All has been reported in several species of 
Pavlova^ in the Eustigmatophyte Nannochloropsis oculata^ and in the diatoms 
Phaeodactylum tricornutum and Thalassiosira pseudonana [11,12,13]. This FA 
accounted for a very small portion of the total FAs in these microalgae, and its 
specific role in the algal cells is iinknown. However, this FA is a very important 

25 precursor in the synthesis of sex pheromones in insects. Sex pheromones are species- 
specific blends of xmsaturated fatty acid (UFA) derivatives that differ in terminal 
functional group and in the number, position and configuration (Z or E) of the double 
bond(s), which are produced by various acyl-CoA desaturases [14,15]. Simple 
monoene All UFAs are the most prevalent precursors in the formation of major sex 

30 pheromome components in the modem Lepidoptera [16,17]. For instance, in the com 
earworm Helicoverpa zea^ which produces a pheromone mixture of Zll-16:Ald and 
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Z9-16:Ald in a 30:1 ratio, the most abundant desaturase-encoding transcript is 
HzeaLPAQ (also called HzPGDsl) which encodes a All-desaturase that does not 
possess a cytochrome b5 extension, and therefore requires free cytochrome b5 for 
activity. Many acyl-CoA All-desaturases with different specificities have been 
5 isolated from insects [14,15], but none from other species. We describe the first 
characterisation of a cytochrome b5 desaturase exhibiting Al 1-desaturase activity. 

According to an aspect of the invention there is provided a transgenic cell comprising 
a nucleic acid molecule which comprises a nucleic acid sequence which nucleic acid 
10 molecule consists of the sequences as represented in Figures 5a, 5b, 6a, 6c, 7a, 8a, 
8b, 9a, 10a, 11a, lib, lid or nucleic acid molecules which hybridise to these 
sequences, wherein said nucleic acid molecules encode a polypeptide which has 
desaturase activity. 

15 In a preferred embodiment of the invention said hybridisation conditions are stringent 
hybridisation conditions. 

hi a preferred embodiment of the invention said nucleic acid molecule comprises a 
nucleic acid sequence which has at least 30% homology to the nucleic acid sequence 
20 represented in Figures 5a, 5b, 6a, 6c 7a, 8a, 8b, 9a, 10a, 11a, lib, lid. Preferably 
said homology is at least 40%, 50%, 60%, 70%, 80%, 90%, or at least 99% identity 
with the nucleic acid sequence represented in Figures 5a, 5b, 6a, 7a, 8a, 8b, 9a, 10a, 
1 la, 1 lb and which encodes a polypeptide which has desaturase activity. 

25 The sequence of desaturase nucleic acids may be modified to produce variant 
enzymes with enhanced expression in cells. For example, the addition of a codon 
that encodes an alanine amino acid may facilitate recombinant expression in 
microbial systems e.g. yeast. These modifications may not be required in all 
expression systems but is sometimes desirable. 

30 
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In a preferred embodiment of the invention said nucleic acid molecule comprises the 
nucleic acid sequence as represented in Figures 5a, 5b, 6a, 6c 7a, 8a, 8b, 9a, 10a, 11a, 
1 lb, lid. Preferably said nucleic acid molecule consists of the nucleic acid sequence 
as represented in Figures 5a, 5b, 6a, 7a, 8a, 8b, 9a, 10a, 11a, lib. 

5 

In a further preferred embodiment of the invention said cell over-expresses said 
desaturase encoded by said nucleic acid molecule. 

In a preferred embodiment of the invention said over-expression is at least 2-fold 
10 higher when compared to a non-transformed reference cell of the same species. 

Preferably said over-expression is: at least 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8- 
fbld, 9-fold, or at least 10-fold when compared to a non-transformed reference cell of 
the same species. 

15 

In a preferred embodiment of the invention said nucleic acid molecule is a cDNA. 

In yet a further preferred embodiment of the invention said nucleic acid molecule is a 
genomic DNA. 

20 

In a preferred embodiment of the mvention said transgenic cell is transfected with a 
nucleic acid molecule comprisiag a nucleic acid sequence as represented by Figure 
10a and which encodes a desaturase polypeptide wherein said polypeptide has All- 
desaturase activity, or a nucleic acid molecule which hybridises to the nucleic acid 
25 molecule in Figure 10a and encodes a polypeptide with Al 1 -desaturase activity. 

In an alternative preferred embodiment of the invention said transgenic cell is 
transfected with a nucleic acid molecule comprising a nucleic acid sequence as 
represented by Figure 8a and which encodes a desaturase polypeptide wherein said 
30 polypeptide has A6-desaturase activity, or a nucleic acid molecule which hybridises 
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to the nucleic acid molecule in Figure 8a and encodes a polypeptide with A6- 
desaturase activity. 

In a preferred embodiment of the invention said transgenic cell is a eukaryotic cell. 

5 

In an altemative preferred embodiment of the invention said cell is a prokaryotic cell. 

In a further preferred embodiment of the invention said eukaryotic cell is a plant cell. 

10 Plants which include a plant cell according to the invention are also provided as are 
seeds produced by said plants. 

In a preferred embodiment of the invention said plant is selected from: com {Zea 
mays)^ canola {Brassica napus, Brassica rapa ssp.), flax (Linum usitatissimum), 

15 alfalfa (Medicago sativa\ rice (Oryza sativa\ rye (Secale cerale), sorghum (Sorghum 
bicolar. Sorghum vulgare), sunflower (Helianthus annus\ wheat (Tritium aestivum\ 
soybean (Glycine max), tobacco (Nicotiana tabacum), potato (Solatium tuberosum), 
peanuts (Arachis hypogaea), cotton (Gossypium hirsutum), sweet potato (lopmoea 
hatatus)^ cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), 

20 pineapple (Anana comosus)^ citris tree (Citruy spp.) cocoa (Theobroma cacao), tea 
(Camellia senensis), banana (Musa spp,), avacado (Persea americana), fig (Ficus 
casiccz), guava (Psidium guajava), mango (Mangifer indica), olive (Olea europaea\ 
papaya (Carica papaya), cashew (Anacardium occidentale), macadamia (Macadamia 
intergrifolia), almond (Prunus amygdalus), sugar beets (Beta vulgaris), oats, barley, 

25 vegetables and omamentals. 

Preferably, plants of the present invention are crop plants (for example, cereals and 
pulses, maize, wheat, potatoes, tapioca, rice, sorghum, millet, cassava, barley, pea), 
and other root, tuber or seed crops. Important seed crops are oil-seed rape, sugar 
30 beet, maize, sunflower, soybean,sorghxmci, and flax (linseed). Horticultural plants to 
which, the present invention may be applied may include lettuce, endive, and 
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vegetable brassicas including cabbage, broccoli, and cauliflower. The present 
invention may be applied in tobacco, cucurbits, carrot, strawberry, sunflower, tomato, 
pepper, 

5 Grain plants that provide seeds of interest include oil-seed plants and leguminous 
plants. Seeds of interest include grain seeds, such as com, wheat, barley, rice, 
sorghum, rye, etc. Oil seed plants include cotton, soybean, safflower, sunflower, 
Brassica, maize, alfalfa, palm, coconut, etc. Leguminous plants include beans and 
peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, 
10 mungbean, lima bean, fava been, lentils, chickpea, etc. 

According to a further aspect of the invention there is provided a seed comprising a 
plant cell according to the invention. Preferably said seed is from an oil seed plant. 

15 According to a yet further aspect of the invention there is provided a reaction vessel 
comprising at least one polypeptide according to the invention, fatty acid substrates 
and co-factors wherein said vessel is adapted for the desaturation of said fatty acids 
substrates. 

20 hi a preferred embodiment of the invention said polypeptide is expressed by a cell 
according to the invention. 

Preferably said cell is a eukaryotic cell, for example a yeast cell. 

25 In an altemative preferred embodiment of the invention said cell is a prokaryotic cell. 

According to a further aspect of the invention there is provided a method to 
desaturate a fatty acid substrate comprising the steps of: 

i) providing a reaction vessel according to the invention; and 
30 ii) growing said cells contained in said reaction vessel under conditions 

which allow the desaturation of at least one fatty acid substrate. 
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An embodiment of the inveixtion will now be described by example only and with 
reference to the following tables and figures: 

5 Table 1 illustrates the composition of major fatty acids in T. pseudonana; 

Table 2 illustrates the major fatty acids of pYES and pYDESN yeast transformants 
with and without addition of exogenous saturated fatty acids; 

10 Table 3 illustrates the A6 desaturase activity of TpDESI compared to that of an 
homologous Phaeodactylum tricornutum desaturase; 

Figure 1 illustrates the predicted protein sequences with homology to front-end 
desaturases derived from the T. pseudonana draft genome. Sequence alignments of 

15 12 putative T. pseudonana desaturases with other fimctionally characterised front-end 
desaturase enzymes identified three main blocks of homology that represent the 
ftmctional domains of front-end acyl desaturases (A). The darker shaded box 
highlights the cytochrome b5 haem-binding domain and shaded boxes indicate three 
histidine boxes. See Material and Methods for Genbank accession number and source 

20 species of the functionally characterised enzymes. A phylogenetic tree of nine T, 
pseudonana desaturases with other enzymes was constructed (B). By removing the 
regions containing gaps (ambiguous alignment region), a dataset was created from an 
alignment originally made with clustalX. The tree was constructed from the dataset 
using Phylip3.5c software package and bootstrap analyses were carried out with 1000 

25 replicates. Only well supported nodes (over 70%) are indicated with bootstrap values. 
All branches are drawn to scale as indicated by the scale bar (=0.1 substitutions/site). 
TpDESN sequence is 477 amino acids long (C). The cytochrome b5 haem-hinding 
domain is on a shaded background and the three histidine-boxes are framed; 

30 Figure 2 illustrates RT-PCR expression analysis of TpdesN, Cells were harvested at 
different stages of growth for total RNA extraction and cDNA synthesis (A). PGR 
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was performed on cDNA derived from reverse transcribed RNA using TpdesN and 
18s rRNA specific primer pairs (B). PGR was carried out on undiluted (lane 1) and 
five-fold serial dilutions (lane 2-4) of each cDNA. The 18S rRNA gene was used as a 
control of cDNA synthesis. EE: early exponential phase, LE: late exponential phase, 
5 ES : early stationary phase; 

Figure 3 illustrates GC analysis of FAMEs fi:om yeast transformed with the empty 
plasmid pYES2 or the plasmid containing TpDESN. Invscl yeast strain transformed 
with either pYES2 (A) or pYDESN (B) were induced for three days at 20°C without 
10 supplementation before sampling for fatty acid analysis. I. S. internal standard (17:0). 
The experiment was repeated three times and results of a representative experiment 
are shown; 

Figure 4 illustrates mass spectra of DMDS FAME adducts from pYDESN 
15 transformed yeast. Mass spectrum of the DMDS adduct of 16:1A9 FAME, present in 
all yeast samples (A). Mass spectrum of the DMDS adduct of 16:1 All FAME, which 
was only found in yeast transformed with pYDESN (B). Picolinyl esters with spectra 
characteristic of 16:1A1 1 were also identified in these samples (data not shown); and 

20 Figure 5a is the genomic nucleic acid sequence of the desaturase A from T. 
pseudonana\ Figure 5b is titie cDNA sequence desaturase A; Figure 5c amino acid 
sequence; 

Figure 6a is the genomic nucleic acid sequence of desaturase B from Thalassiosira 
25 pseudonana; Figure 6b is the partial amino acid sequence; Figure 6c is the cDNA 
sequence of desaturase B; and Figure 6d is the amino acid sequence of said cDNA 
sequence; 

Figure 7a is the nucleic acid sequence of desaturase E from Thalassiosira 
30 psuedonana; Figure 7b is the amino acid sequence; 
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Figvtre 8a is the nucleic acid sequence of desaturase I jfrom Thalassiosira 
pseudonana; Figure 8b is the cDNA sequence; and Figure 8c is the amino acid 
sequence; 

5 Figure 9a is the nucleic acid sequence of desaturase K j&om Thalassiosira 
pseudonana; Figure 9b is liie amino acid sequence; 

Figure 10a is the nucleic acid sequence of desaturase N from Thalassiosira 
pseudonana; 

10 

Figure 11a is the nucleic acid sequence of desaturase O from Thalassiosira 
pseudonana; Figure lib is the cDNA sequence; Figure 11c is the amino acid 
sequence; Figure lid is the nucleic acid seqiaence of desaturase O variant sequence 
from Thalassiosira pseudonana; and Figure lie is the amino acid sequence of said 
1 5 variant desaturase O; 

Figure 12A and 12B is a GC analysis of FAMEs from yeast expressing TpDESI with 
exogenous substrates 18:2A9,12 (A) and 18:3 A942,15 (B). New FAs produced from 
endogenous and exogenous substrates are underlined; Figure 12C is a GC analysis of 
20 FAMEs from yeast transformed with a vector only control compared to yeast 
transformed with TpDESI; 

Figure 13 is an illustration of fatty acid synthesis pathways; and 
25 Figure 14 is a GC analysis of FAMEs from yeast expressing TpDESO. 
Materials and Methods 

Ideatification of putative Thalassiosira pseudonana desaturase-coding 
30 sequences and phvlogenetic analysis with other functionally characterised 
desaturases 
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The draft genome of the diatom T. pseudonana has been sequenced to approximately 
nine times coverage by the whole genome shotgun method. Sequence data were 
produced by the US Department of Energy Joint Genome histitute 
5 (http ://www.i gi.doe> govA and the raw sequence data were downloaded and installed 
on a local server. Batch tblastn searches were carried out using protein sequences of 
the following 13 known desaturases as query, including PlDESl (AY332747, 
Pavlova lutheri), TFAD4 (AF489589, Thraustochytrium sp, ATCC 21685), TFAD5 
(AF489588, Thraustochytrium sp. ATCC 21685), PtDEL5 (AY082392, 

10 Phaeodactylum tricornutum), PtDEL6 (AY082393, Phaeodactylum tricornutum), 
EgDELS (AF139720, Euglena gracilis), EgE)EL4 (AY278558, Euglena gracilis), 
ZfDEL (AF309556, Danio rerio\ B0DEL6 (U79010, Borago officinalis), HsDELS 
(AF084558, Homo sapiens), HsDEL6 (AF084559, Homo sapiens), CeDEL6 
(AF031477, Caenorhabditis elegans) and CeDEL5 (AF078796, Caenorhabditis 

15 elegans). 

All non-redundant sequences with an E value less than 0.001 were retrieved and 
assembled into contigs using the CAP3 sequence assembly program [18]. The contigs 
were translated into amino sequences in three frames in the orientation indicated by 
20 tblastn result. Putative desaturase gene models were constructed manually based on 
sequence homology and in frame GT-AG intron boundaries were identified. 

Deduced amino acid sequences of all 12 putative desaturase sequences of T. 
pseudonana were aligned with the above 13 functionally characterised desaturases 
25 from other species, using ClustalX version. 1.8 [19]. The alignment was then 
reconciled and ftirther adjusted. Only nine near frill-length Thalassiosira sequences 
were retained for ftirther analyses. 

A dataset of 250 conserved residue positions was used for constmction of the 
30 phylogenetic tree. Distance analysis used the program protdist of the Phylip 3.5c 
package with a PAM250 substitution matrix and a tree was then built from the matrix 
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using fitch (Fitch-Margoliash method). Bootstrap analyses were carried out with 
1000 replicates using the neighboiur-joining algorithm. 

Cultivation of T. vseudonana 

5 

T, pseudonana (CCAP 1085/12) was obtained from the Culture Collection of Algae 
and Protozoa (Dunstafftiage Marine Lab., Oban, PA34 4AD, Scotland, U.K.). The 
growth medium used was enriched artificial seawater medium (EASW), made up in 
20 1 batches as described previously [4]. The cultures were grown in one litre flasks 
10 at 15°C with 50 jiE m'^ s"^ constant illumination, and aeration provided by shaking 
the flasks at 150 rpm. 

Cell density was monitored by cormting cells with a haemocytometer. Nitrate 
concentration was determined periodically during the culture time by measuring the 
15 change of the medium absorbance at 220 nm [20] . 

RNA extraction, cDNA svntliesis and RT-PCR analysis 

Total RNA was extracted from frozen cells harvested at different stages of growth 
20 with an RNeasy plant inini kit (Qiagen). First strand cDNA was synthesised from 
three ptg of DNAse treated RNA using a Prostar First-strand RT-PCR kit 
(Stratagene). PGR was performed using undiluted and jSve-fold dilutions of cDNAs 
as followed: the reactions were heated to 95 "^C for 5 min followed by 35 cycles at 95 
^C for 30 s, 50 ^'C or 65*^C (for 18S rRNA and 2>^/ejJV^ respectively) for 30 s and 72 °C 
25 for 2 min, then a single 72 **C for 10 min. As a marker for constitutive expression, the 
18S rRNA gene was amplified with the primer TH18S5' (5'- 
GGTAACGAATTGTTAG-3') and TH18S3' (5'-GTCGGCATAGTTTATG-30. 
TpdesN cDNA was amplified using primers DESNR2 (5'- 
GTGAGAGCACTAACCAAGCTT-3') and DESN2 (5'- 

30 CAATCAGTAGGCTTCGTC G-3'). Aliquots of PGR reaction were electrophoresed 
through a 1% agarose gel. Identity of the diagnostic fragment amplified with TpdesN 
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Specific primers was verified by sequencing after cloning in the pGEM-T EasyVector 
(Promega). 

Functional characterisation of TpDESI in yeast 

5 

The entire TpdesI coding region was amplified firom Zl pseudonana cDNA with 
primers DesINB 5'- GC GGGATCC A CC ATGGCKjGAAAAGGAGGAGAC-3 ' 
(ORF start codon is indicated by bold type; imderlined sequence is a BamHl site; 
italic sequence is an added alanine codon, not present in the original sequence of 

10 Pldesl) and DesICE 5'-GCGAATTCTTACATGGCAGGGAAATC-3' (ORF stop 
codon is indicated in bold type; imderlined sequence is a EcdRl site). The Expand 
High Fidelity PGR system (Roche) was employed to minimise potential PGR errors. 
The amplified product was gel purified, restricted and cloned into the corresponding 
sites behind the galactose-inducible GALl promoter of pYES2 (Invitrogen) to yield 

15 liie plasmid pYDESl. This vector was transformed into S, cerevisiae strain Ihvscl 
(Invitrogen) by a lithium acetate method, and transformants were selected on 
minimal medium plates lacking uracil. 

For functional expression, cultures were grown at 25°C in the presence of 2% (w/v) 
20 raffinose and 1% (w/v) Tergitol NP-40 (Sigma). Expression of the transgene was 
induced when ODeoonm reached 0.2-0.3 by supplementing galactose to 2% (w/v). At 
that time, the appropriate fatty acids were added to a final concentration of 50 jU-M. 
Incubation was carried out at 25°C for three days. 

25 Functional cliaracterisation of TpDESN in yeast 

Genomic DNA firom T, pseudonana cells was extracted using the DNA isolation kit 
Puregene (Gentra Systems) and 100 ng was used to amplify the entire TpdesNcodmg 
region with primers DesNNB 5'- 

30 GCGGGATCCACCATGGCrGACTTTCTCTCCGGC-3' (ORF start codon is 
indicated by bold type; underlined sequence is a BamHl site; italic sequence is an 
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added alanine codon, not present in the original sequence of TpdesN) and DesNCE 
5'-GCGAATTCTCAATCAGTAGGCTTCGT--3' (ORF stop codon is indicated in 
bold type; underlined sequence is a EcoRl site). The Expand High Fidelity PCR 
system (Roche) was employed to minimise potential PCR errors. The ampUfied 
5 product was gel purified, restricted with EcoRl and BamHl and cloned into the 
corresponding sites behind the galactose-inducible GALl promoter of pYES2 
(Invitrogen) to yield the plasmid pYDESN. The fideUty of the cloned PCR product 
was checked by sequencing. The vector pYDESN was then transformed into S, 
cerevisiae strain Invscl (hivitrogen) by a lithium acetate method, and transformants 
10 were selected on minimal medium plates lacking uracil. 

For the feeding experiment with PUFAs, cultures were grown at 22°C in the presence 
of 2% (w/v) raffmose and 1% (w/v) Tergitol NP-40 (Sigma). Expression of the 
transgene was induced when ODeoonm reached 0.2-0.3 by supplementing galactose to 

15 2% (w/v). At that time, the appropriate fatty acids were added to a final concentration 
of 50 fiM. Incubation was carried out at 22°C for three days and then 15''C for 
another three days. For the feeding experiment with saturated fatty acids, a single 
Invscl colony transformed with pYES2 (empty plasmid, control) or pYDESN -was 
inoculated in 10 ml of minimal media minus uracil containing 2% raffmose and 

20 grown overnight at 30°C with shaking (300 rpm). After 16-24 h, cells were collected 
by spinning at 4500 rpm for 10 min. After discarding the supematant, the cell pellet 
was resuspended in the same medium mentioned above supplemented with 2% 
galactose and 1% tergitol, to obtain a cell density of 5x10'^ cells/ml. Fifteen ml of this 
cell suspension were added to a 100 ml-flask with or without addition of saturated 

25 fatty acids (as mentioned in the text) at 500 fxM final concentration. Desaturase 
induction was then carried out at 20^C with shaking (300 rpm) for three days. 

Fatty acid analysis 

30 Microalgae or yeast cells were harvested by centrifiigation. Total fatty acids were 
extracted and transmethylated as previously described [4]. Most FAMEs were 
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identified by comparison of retention times to a 37 FAMIE mix (Supelco). PUFA 
FAMES were also identified by comparison to a sample orf standard Menhaden oil 
(Supelco) transmethylated as per the samples. 

Dimethyl disulphide (DMDS) adducts were used to determine the double bond 
position in identified and unidentified monounsaturated FAJVEEs. These were made 
by adding together 50 inl DMDS (Aldrich), 100-1000 ng F^AMEs dissolved in 50 ^il 
hexane, and 5 ^1 50 mg mf^ iodine in diethyl ether. This solution was heated at 40°C 
for 15 h and partitioned with 200 \il hexane and lOO \xl 5% (w/v) sodium 
thiosulphate. The hexane phase was removed, dried under vacuum, reconstituted in 
50 fxl fresh hexane and used for GC-MS analysis, A Trace GC 2000 (ThermoQuest) 
fitted with a 30 m x 0.25 mm x 0.5 |Lim film thickness ZB-1 column (Phenomenex) 
was used to chromatograph 2 \il DMDS adducts injected at 250°C and a 50:1 split 
ratio with He as carrier gas at 0.6 ml min ^ in constant flow xnode. The oven program 
was 120°C for Imin then to 340°C at 5°C min"\ Mass spectra were obtained using a 
GCQ (ThemioQuest) mass spectrometer operating in full scan mode over 50-500 
m/z. Picolinyl esters were also made fi-om FAMEs to confixm their identities. These 
were obtained by adding 15 jil freshly prepared 2:1 (v/v) 3- <hydroxymethyl)-pyridine 
(Aldrich): potassium tert butoxide 1 M solution in tetrahydroftiran (Aldrich) to 50 ill 
FAMES dissolved in dichloromethane. This solution was hieated at 40°C for 30 min 
and partitioned with 200 |li1 hexane and 100 |uil 2.5% (w/v) sodium hydrogen 
carbonate. The hexane phase was removed, dried under va<3uum and reconstituted in 
50 ^1 fresh hexane. Picolinyl esters were injected and separated by GC-MS using the 
same conditions as for DMDS adducts; Sperling P., Zaliringer U. and Heinz E. 
(1998) A sphingolipid desaturase from hi^er plants. J. Biol. Chem. 273, 28590- 
28596; Sperling P., Libisch B., Zahringer U., Napier J.A. and Heinz E. (2001) 
Fimctional identification of a D8-sphingolipid desaturase from Borago officinalis. 
Arch. Biochem. Biophys. 388, 293-298; Whitney H.M., Michaelson, L.V., 
Sayanova, O., Pickett J. A. and Napier, J A. (2003) Functional characterization of two 
two cytochrome b5-fusion desaturases from Anemone ieveillei: The unexpected 
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identification of a fatty acid A6-desaturase. Planta 217, 983-992; each of which are 
incorporated by reference. 

EXAMPLE 1 

Identification and phylogenetic analysis of putative T, pseudonana desaturase 
sequences with other functionally characterised desaturases 

Tblastn searches with 13 functionally characterised desaturases revealed 427 non- 
redundant raw sequences with E values less than 0.001. Twelve unique contigs were 
assembled after retrieving these sequences and gene models were constructed 
manually based on sequence homology. These 12 gene contigs were arbitrarily 
designated TpdesA to TpdesL, All 12 showed significant sequence similarity to query 
sequences with 9 containing near full length open reading frames compared to other 
known desaturases (Fig. lA). Interestingly, the predicted amino acid sequence of all 
nine T. pseudonana desaturases have a characteristic fused cytochrome bS haem- 
binding domain (HP[G/A]G) at their N-terminus and three histidine boxes (H[X]3- 
4H, H[X]2-3HH AND Q[X]2-3HH) with the replacement of the first histidine by 
glutamine in the third histidine box in all but two of the predicted proteins (TpDESA 
and TpDESB). These are common characteristics of a large subgroup of front-end 
acyl group desaturases [21]. These histidine-box motifs are critical for desaturase 
activity, most likely because they serve to coordinate the diiron-oxo component of 
the active site. Three remaining sequences (TpDESD, TpDESL and TpDESH) appear 
to be partial, covering only the C~terminal end of desaturases, but nevertheless they 
do contain a typical third histidine box of the above mentioned subgroup of 
desaturases (Fig. lA). 

In order to gain insight into the relationships of these T. pseudonana sequences to 
other functionally characterised desaturases and especially algal desaturases, we 
constructed an unrooted phylogenetic tree using a Fitch-Margoliash method with 
statistical confidence measured by bootstrap analysis (Fig. IB). Relationships of four 
putative T. pseudonana desaturases are in well supported (>70% bootstrap value) 
subgroups witihi at least one functionally characterised desaturase firom other species. 
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Both TpDESM and TpDESO grouped with PtDELS, a A5-desaturase from another 
diatom, P. tricornutum [5], suggesting these two enzymes may also have a A5- 
desaturase activity. Similarly TpDESK is grouped with two A4-desaturases TFAD4 
and EgDEL4 from Tltraustochytrium sp, ATCC21685 [22] and E. gracilis 
5 respectively. TpDESI grouped with PtDEL6, a A6-desaturase from P. tricornutum. 
This indicates that TpDESK and TpDESI may have A4 and A6-desaturase activities 
respectively. However, as enzymes with different regioselectivities are also found in 
a well supported subgroup (EgDELS, CeDELS and CeDEL6; A8, A5 and A6- 
desaturase respectively) and regioselectivity may even derive independently after a 
10 more recent dupUcation (CeDELS and CeDEL6) [23] predictions based on homology 
can be misleading and it is essential to functionally characterise each enzyme. 

The remaining five T. pseudonana sequences fall into three separate subgroups 
(TpDESE; TpDESA and TpDESB; TpDESG and TpDESN) which do not group with 
15 any other known fimctional desaturases with high confidence. It is therefore possible 
that these proteins exhibit novel regioselectivity. The current study focussed on the 
characterisation of one of these proteins, TpDESN. 

EXAMPLE 2 

20 

Temporal expression of TpDESN gene 

RT-PCR analysis of TpdesN transcript was conducted at different stages of algal 
growth in order to establish if and when this gene is expressed. After RNA extraction 

25 and cDNA synthesis, TpdesN specific PGR products were amplified. PGR 
amplification of the 18S rDNA gene was performed as a control for the quantity of 
cDNA used during PGR reactions. Figure 2 shows that the diagnostic 519 bp cDNA 
amplification product expected for TpdesN was present at similar level at the 
different stages of cultivation of the microalga cells. Thus, TpdesN is 

30 transcriptionally active at a constitutive level during Thalassiosira growth, 
suggesting that it may encode a desaturase with a housekeeping fimction. 

16 
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EXAMPLES 

Functionai characterisation of TpDESN in yeast 

5 

The putative desatxirase sequence annotated TpdesN was contained on a genomic 
DNA contig of 2580 bp on which no introns was detected. To estabUsh the function 
of the protein encoded by this gene, the full-length sequence was amphfied from 
genomic DNA. An alanine codon containing a G as the first letter was added 

10 immediately downstream of the start codon of TpdesN to ensure optimal translation 
in yeast [24]. The TpdesN ORF is 1434 bp long, and encodes a 477 amino acid 
protein TpDESN (Fig. IC), having a molecular weight of 53.8 kDa. Analysis of the 
secondary structure of TpDESN using SOSUI software 
(http://sosui.proteome.bio.tuat.ac.jp/sosuiframeO.html) [25] predicted four 

15 transmembrane regions (not shown). Alignment of TpDESN with functionally 
characterised desaturase sequences mentioned above indicated an overall identity of 
25%, with the cytochrome Z?5-like domain and the three conserved histidine-rich 
motif areas showing greatest homology. 

20 The primary sequence of TpDESN exhibited typical features of front-end desaturases 
involved in PUFA synthesis. In order to characterise flie specificity of this protein, 
PUFAs (18:2A9,12; 20:2A11,14; 20:3A841,14; 22:4A5,8,lia4; 18:3A9,12a5; 
20:3A1 1,14,17; 20:4A8,1 1,14,17; 22:5A7,10,13,16,19) where first fed to the host 
yeast transformed with pYDESN and the vector alone (pYES2) as a control. 

25 Unexpected!}^, after six days of incubation, TpDESN did not desaturate any of the 
supplemented PUFA substrate. Furthermore, there did not appear to be any 
production of 18:2A9,12 from endogenous 18:1A9. However, a significant increase 
was observed for a peak eluting in the range of sixteen carbon monounsaturated 
FAMES in the yeast transformed with pYDESN (Fig. 3). The position of the double 

30 bond in this product was determined by GC-MS analysis of FAMEs derived to 
DMDS adducts [26] and picolinyl esters. The DMDS adduct of 16:1A9 FAME 
yielded two major fragments at m/z 145 and 217 (Fig. 4A). Fragmentation of the 
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increased FAME peak found in unfed or fed yeast transformed with pYDESN 
produced two diagnostic fragments at /n/z 117 and 245 (Fig. 4B). This fragmentation 
pattern was indicative of an All monounsaturated sixteen carbon FAME, 16:1 All, 
suggesting that TpDESN encoded a new All-desaturase. Small amounts of this FA 
have also been measured in Thalassiosira cells (Table 1). To further substantiate 
these results, yeast transformed with pYDESN and the control empty vector, pYES2, 
were cultivated in medium supplemented with saturated FA (14:0; 16:0; 18:0) 
representing potential substrates for the synthesis of the monoxmsaturated product. 
Yeast fatty acid profiles were analysed after three days of incubation at 20°C. Results 
in Table 2 showed that a small amount of 16:1 Al 1 (0.23% of total FAs) was detected 
in yeast transformed with pYES2, suggesting endogenous synthesis of this FA from 
16:0. This FA accumulated at a higher level in both types of transformed yeast after 
feeding with 14:0, with values up to 5.84% in pYDESN transformants. A possible 
explanation for this increase in the pYES2 transformants is that the endogenous yeast 
A9-desaturase was able to use additional 14:0 to produce 14:1A9 that was 
subsequently elongated to 16:1A11. Moreover, it has been reported that wild type 
yeast cells cultivated in media supplemented with 14:1A9 synthesised 16:1 All by 
Elolp-dependent carboxy terminal elongation [27], After 18:0 supplementation, the 
percentage of 16:1 All, of about 6% total FAs, was similar to that observed after 
feeding with 16:0. Presence of extra 18:0 could lead to an inhibition of the 16:0 
chain elongation system, which might allow more 16:0 to be available for All- 
desaturation. On the other hand, 18:1 All represents 1.2% of the total FAs in 
transgenic yeast. No variation in its proportion was monitored under the different 
conditions of incubation, even after supplementation with 18:0 in pYDESN 
transformants. This suggests that this FA originates from elongation of 16:1A9 rather 
than Al 1-desaturation of 18:0. 

EXAMPLE 4 

Functional characterisation of TpDESI in yeast 

To estabUsh the fimction of TpDESI, the fiiU-length cDNA was expressed in the 
yeast Invscl under the control of an inducible galactose promoter. Potential 

18 
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substrates of front-end desaturases (18:2A9,12; 18:3A9,12,15; 20:3A8,11,14; 
20:4A84 1,14,17; 22:4A7,1043,16; 22:5A7,10,13 ,16,19) were tested. Figure 12A and 
12B show that after supplementation of the medium with 18:2A^'^^ and 18:3A^'^^'^^ 
respectively, and after three days of incubation, yeast cells containing pYDESI had 
extra fatty acids. Extra peaks observed when cells were fed with 18:2A9,12 had a 
retention time identical to 16:2A6,9, 18:2A6,12and 18:3A6,9,12 (Figure 12A). Extra 
peaks observed when cells were fed with 18:3A9,12,15 had a retention time identical 
to 16:2A6,9, 18:2A6,12 and 18:4A6,9,12,15 (Figure 12B). These results demonstrate 
that TpdesI encodes a A6-desaturase which can introduce double bond in 
exogenously fed 18:2A9,12 and 18:3A9,12,15 fatty acids, but also in endogenous 
16:1A9 and 18:1A9 fatty acids. Percentages of conversion of these different 
substrates are given in Table 3. 

Fatty acid profiling of marine microalgae had shown that T pseudonana represents a 
good candidate to discover genes involved in the production and storage of PUFAs 
[4], Analysis of the recently completed draft genome of this microalga revealed the 
presence of many candidate genes for elongase and desaturase activities most 
probably involved in catalysing different steps of the PUFA biosynthetic process. We 
have identified 12 possible desaturase genes, 9 of which there is sufficient sequence 
information to demonstrate that they exhibit typical features of front-end desaturases, 
i,e. a cytochrome b5 domain in the N-terminus and three histidine clusters located at 
highly conserved regions. Phylogenetic analysis revealed that several of the genes are 
closely related to a number of previously characterised front-end desaturases 
involved in PUFA synthesis. However, the current work highlights the fact that 
desaturase function, in terms of regioselectivity, cannot solely be based on prediction 
from primary amino acid sequence homology. 

The fatty acid profile of T, pseudonana cells is quite diverse (Table 1), with the 
health beneficial EPA (20:5A5,8,1 1,14,17) and DHA (22:6A4,7,10,13,16,19) 
accounting for a large proportion. However, the number of desaturase gene sequences 
found in the genome was higher than we expected based on the number of different 
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desaturation reactions required to produce the diversity of FA in this microalga. This 
suggested that non-obvious desaturation reactions might also occur in the 
Thalassiossira cells. As a first step to establishing function of the many putative 
desaturase sequences, we focused on the TpdesN contig due to the fact that the 
sequence was full-length and intronless. A temporal expression study showed that 
TpdesN was constitutively transcribed during algal cultivation. Expression of the 
TpdesN ORF in yeast supplemented with PUFAs as potential substrates for 
desaturation revealed no new products. There was also no evidence of activity with 
the endogenous 18:1A9 which excludes the possibility that TpDESN acts as a A12- 
desaturase. However, an increase in the peak area of a FAME eluting in the range of 
the sixteen carbon FAMEs was identified and GC-MS based analysis revealed this to 
be 16:1 All fatty acid. Small amoimts of this FA are also present in wild type yeast. 
However, quantitative comparison of FA levels in the empty vector pYES2 and 
pYDESN transformants showed that proportions of 16:1 All increased in the 
presence of TpdesN in both unfed cells and cells that had been fed different saturated 
FA. No other changes in either peak area or new peaks were detected in pYDESN 
transformants, indicating that TpDESN is specifically involved in conversion of 16:0 
to 16:1A11. 

The presence of small amoxmts of 16:1 All have previously been reported in many 
microalgae, including T pseudonana. However, a function for this FA in algal cells 
has not been established. The low quantity observed in many marine microalgae 
suggests that it may act as an intermediate in an as yet unidentified biosynthetic 
pathway, hi insect cells, 16:1 All represents an important precursor for pheromone 
synthesis, where it is produced by an acyl-CoA All-desaturase. Interestingly, the 
insect All-desaturases do not possess a cytochrome b5 domain in their N-terminal 
region. This represents a major primary structure difference compared with TpDESN. 
The cytochrome b5 domain is not a determinant of the substrate specificity [28]. 
Alignment of the desaturase domain of TpDESN with the full sequence of insect 
All-desaturases showed an identity of 20% (data not shown). In uisect cells, All- 
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desaturases are more or less specific depending on the origin of the sequence and 
well-documented reviews exist on this subject [14,15], 

In conclusion therefore, although the TpDESN primary sequence is very similar to 
5 front-end desaturases, it should not be considered a member of this family of 
desaturases because it acts only on 16:0. Identification of such a novel enzyme 
expands the fimctional repertoire of the membrane-bound desaturases and it should 
provide useful comparative information for understanding phylogenetic relationships 
between these enzymes. One question that remains to be answered regards whether 

10 cytochrome b5 was independently fused to desaturases that had aheady acquired their 
different specificities, or whether an ancestral fusion protein for proximal lipid 
modification duplicated and subsequently evolved into different desaturases. Studies 
of the primary structure of the different PUFA desaturases support the fact that 
enzyme conversion (i.e. change of specificity) can be achieved through a relatively 

15 few structural changes [29]. The high degree of homology between the many 
potential firont-end desaturases identified in the genome of T, pseudonana support 
this notion. Given the FA profile of T. pseudonana cells and the complexity of the 
desaturase gene family it is likely that different genes will encode A4, A5 and A6 
desaturases. It will now be very interesting to functionally characterise these 

20 remaining putative desaturase genes and study the relationship between 
regioselectivity, primary amino acid sequence and phylogenetic relationship. A 
crystal structure for these enzymes is still not available due to technical difficulties in 
obtaining sufficient quantities of purified membrane-boxmd protein. Molecular 
genetic approaches involving site-directed mutagenesis have provided new insight 

25 mto structure-function relationships, including for example that residues in close 
proximity to the histidine motifs have been found to be involved in shifting the ratio 
of desaturation/hydroxylation activities [30]. Detailed comparative analyses and 
computer modeling of these diverse desaturases from T. pseudonana may further 
guide site-directed mutagenesis studies aimed at defining key residues controlling 

30 substrate specificity and regioselectivity of the introduced double bond. 
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